MaxGNR: A Dynamic Weight Strategy via Maximizing Gradient-to-Noise Ratio for Multi-task Learning
نویسندگان
چکیده
When modeling related tasks in computer vision, Multi-Task Learning (MTL) can outperform Single-Task (STL) due to its ability capture intrinsic relatedness among tasks. However, MTL may encounter the insufficient training problem, i.e., some non-optimal situation compared with STL. A series of studies point out that too much gradient noise would lead performance degradation STL, however, scenario, Inter-Task Gradient Noise (ITGN) is an additional source for each task, which also affect optimization process. In this paper, we ITGN as a key factor leading problem. We define Gradient-to-Noise Ratio (GNR) measure relative magnitude and design MaxGNR algorithm alleviate interference task by maximizing GNR task. carefully evaluate our on two standard image datasets: NYUv2 Cityscapes. The results show outperforms baselines under identical experimental conditions.
منابع مشابه
Learning Task Allocation via Multi-level Policy Gradient Algorithm with Dynamic Learning Rate
Task allocation is the process of assigning tasks to appropriate resources. To achieve scalability, it is common to use a network of agents (also called mediators) that handles task allocation. This work proposes a novel multi-level policy gradient algorithm to solve the local decision problem at each mediator agent. The higher level policy stochastically chooses a task decomposition. The lower...
متن کاملOnline Multi-Task Gradient Temporal-Difference Learning
Reinforcement learning (RL) is an essential tool in designing autonomous systems, yet RL agents often require extensive experience to achieve optimal behavior. This problem is compounded when an RL agent is required to learn policies for different tasks within the same environment or across multiple environments. In such situations, learning task models jointly rather than independently can sig...
متن کاملMetric Learning via Maximizing the Lipschitz Margin Ratio
In this paper, we propose the Lipschitz margin ratio and a new metric learning framework for classification through maximizing the ratio. This framework enables the integration of both the inter-class margin and the intra-class dispersion, as well as the enhancement of the generalization ability of a classifier. To introduce the Lipschitz margin ratio and its associated learning bound, we elabo...
متن کاملOnline Multi-Task Learning for Policy Gradient Methods
Policy gradient algorithms have shown considerable recent success in solving high-dimensional sequential decision making tasks, particularly in robotics. However, these methods often require extensive experience in a domain to achieve high performance. To make agents more sampleefficient, we developed a multi-task policy gradient method to learn decision making tasks consecutively, transferring...
متن کاملImage Cosegmentation via Multi-task Learning
Image segmentation has been studied in computer vision for many years and yet it remains a challenging task. One major difficulty arises from the diversity of the foreground, which often results in ambiguity of backgroundforeground separation, especially when prior knowledge is missing. To overcome this difficulty, cosegmentation methods were proposed, where a set of images sharing some common ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2023
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-26319-4_31